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Abstract 



We present a novel approach to nonlinear constrained Tikhonov regularization from the viewpoint 
of optimization theory. A second-order sufficient optimality condition is suggested as a nonlinearity 
condition to handle the nonlinearity of the forward operator. The approach is exploited to derive 
, convergence rates results for a priori as well as a posteriori choice rules, e.g., discrepancy principle and 

' balancing principle, for selecting the regularization parameter. The idea is further illustrated on a 

' general class of parameter identification problems, for which (new) source and nonlinearity conditions 

, are derived and the structural property of the nonlinearity term is revealed. A number of examples 

including identifying distributed parameters in elliptic differential equations are presented. 
Keywords: second-order sufficient condition, nonlinearity condition, source condition, Tikhonov reg- 
ularization, nonlinear inverse problem, parameter identification 



^ 1 Introduction 

^ . . . 

l/^ , In this paper, we discuss a robust method for solving ill-posed nonlinear operator equations 

p; K{u)^g\ (1) 

' where £ H denotes the noisy data, with its accuracy relative to the exact data ~ K(u^) {u^ £ X vs. 

the exact solution) measured by the noise level 5 = ||.g^ — g'^H . Here the nonlinear operator K : X ^ H 
is Frechet differentiable, and the spaces X and H are Hilbert spaces. 

In practice, the unknown coefficient u may be subjected to pointwise constraint, e.g., u > c almost 
everywhere. This is especially true for distributed coefficient estimation in differential equations to ensure 
the well-definedness of the operator K, see, e.g., [T1[T31[3] for relevant examples. We denote the constraint 
' set hy C C X, and assume that it is closed and convex and G C. 

To obtain an accurate yet stable approximation, we employ the now classical approach of minimizing 
the following Tikhonov functional 

Mu)^^\\Kiu)-g'f + ^\\u\\^ 

where the two terms are the fidelity incorporating the information in the data g^ and a regularization for 
stabilizing the problem, respectively. A faithful choice of the fidelity depends on the statistics of noises 
corrupting the data [5]. The penalty is chosen to refiect a priori knowledge (such as smoothness) and 
constraint on the expected solutions, and nonsmooth penalties may also be adopted. 
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Consequently, we arrive at the following constrained optimization problem: 

mm{j,{u)^l\\Kiu)-g'f + ^l\\uf}, (2) 

given a regularization parameter 77 > 0. The minimizer of ([2]) is denoted by w^, and respectively, the 
minimizer for the exact data by u^. In practice, it is important to develop rules for determining 
the scalar parameter 77 > automatically so as to obtain robust yet accurate approximations ufj to the 
exact solution . We will analyze the discrepancy principle |26| , which uses a precise knowledge of the 
noise level 6 and determines rj by \\K{uf^) — g^\\ ~ 6, and two heuristic rules (balancing principle and 
Hanke-Raus rule), which do not require a knowledge of the noise level and are purely data-driven. 

Since the pioneering works [8l |29] , nonlinear Tikhonov regularization (in the absence of constraints) 
has been intensively studied [71 Chap. 10]. Various existence, stability and consistency results were 
established, and diverse practical applications have been successfully demonstrated. Also convergence 
rates results were derived for several choice rules, e.g., discrepancy principle and monotone error rule 
[4j[30]. The essential ingredients of convergence rates analysis are the source and nonlinearity conditions. 

To derive a convergence rates result, extra conditions on the exact solution are necessary [7]. They 
are collectively known as source conditions, and often impose certain smoothness assumptions on . 
In the absence of constraints, it is usually expressed via range inclusion, e.g., ~ K'{u'^)*w for some 
representer w £ H, or variational inequalities. In this work, we shall use an appropriate source condition 
in the presence of convex constraints, derived from the viewpoint of optimization theory as in We 
refer to [27l [23] for related results on constrained linear inverse problems. 

One also needs conditions on the operator K to control its degree of nonlinearity. One classical 
condition is that the derivative K'{u) of the operator K is Lipschitz continuous with its Lipschitz constant 
L satisfying L\\w\\ < 1 [2- We shall propose a second-order sufficient condition on the solution 
as an alternative. It is much weaker than the classical one, yet sufficient for analyzing the Tikhonov 
functional ([2]), i.e., establishing convergence rates results. The idea is further explored on a broad class 
of nonlinear parameter identifications, by exploiting explicit structures of the adjoint operator K'{u^)* 
and by revealing the structure of the crucial nonlinearity term {w, E{u, u^)) for bilinear problems. 

The rest of the paper is structured as follows. In Section [21 we motivate the source condition ([9]) 
and the nonlinearity condition (jlOp using optimization theory. In Section [3] we establish an a priori 
convergence rate of the discrepancy principle |26j and a posteriori convergence rates for two heuristic 
rules (balancing principle [151 [T4| and Hanke-Raus rule [10l[7]). In Section [4] we illustrate the approach 
on a general class of nonlinear parameter identification problems. Here we shall derive new source and 
nonlinearity conditions, and reveal the structural property of the nonlinearity term. Finally, the abstract 
theories are worked out in Section [5l for several concrete examples. An example is given, for which the 
smallness assumption (i||w|| < 1) in the classical nonlinearity condition is violated, whereas the proposed 
nonlinearity condition ([TO)) always holds. Also, detailed derivations are presented for three representative 
inverse coefficient problems. Throughout the paper, we shall use the symbol (•, •) to denote both inner 
products in Hilbert spaces and duality pairing, and the notation c to denote a generic constant which 
may change at different occurrences but does not depend on the quantities of interest. 

2 Second-order sufficient condition 

In this section, we develop the new approach from the viewpoint of optimization theory. We begin with a 
second-order necessary condition for the minimizer m^. Then we propose using a second-order sufficient 
condition as a nonlinearity condition, show its connection with classical conditions, and establish its role 
in deriving basic error estimates. 
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2.1 Necessary optimality system 

Consider the following generic constrained Tikhonov regularization formulation 

inui(j){u, g^) +riip{u), 

where the fidelity (f>{u,g) is differentiable in the first argument, the penalty tjj{u) is convex and (weakly) 
lower semi-continuous, and the constraint set C is convex and closed. Our derivation of the necessary 
optimality condition follows from |17| . 

Let Uri be a minimizer of the problem, i.e., 

(j){u,j,g^) + r]ip{ur,) < (j){v,g'^) + i]i^{v) Vw S C. 

Since ip is convex, for v = m,, + 1 (m — w,,) e C with uGC,0<i< 1 

<P{v,g^) - ipiur^.g^) ^ _^ - V'('«r,) 



t ~ t 

By letting i — > O"*", we obtain the necessary optimality condition 

(<^'K,5^),w-M^) - VK)) > VueC. (3) 
Now let dip{u) denote the subdifferential of the convex functional ip at u, i.e., 

d^piu) = e X* : > ^{u) + {^,u- u) Vu £ X}. 

Consequently, it follows from ([3]) and the convexity of ip that if there exists an element € dil)(uri) and 
let ^ 

then we have 

0'(u^,.g^) + 77^,, -77Ai,, = 0, 

(/i,,, M — u,j) > Vu e C, (4) 

Thus, /ir, €: X* serves as a Lagrange multiplier for the constraint C, cf. [Ml Thm. 3.2]. If tp'{uri) G X* 
exists, then = ip'{urj) and thus /i^ = ^(f)' {uri^g^) -\- tp' {ur^) . In a more general constrained optimization, 
the existence of /i,, e X* is guaranteed by the regular point condition [211 [T7]. The inequality (HI) is the 
first-order optimality condition. We refer to [231 [T7] for a general theory of second-order conditions (see 
also Lemma 2.1). 



2.2 Source and nonlinearity conditions 

Henceforth we focus on problem We will propose a new nonlinearity condition based on a second- 
order sufficient optimality condition. To this end, we first introduce the second-order error £'(u, u) of the 
operator K [17] defined by 

E{u,u) = K{u) - K{u) - K'{u){u ~ u). 

which quantitatively measures the degree of nonlinearity, or pointwise linearization error, of the operator 
if, and will be used in deriving our nonlinearity condition. We also recall the first-order necessary 
optimality condition for Utj (cf., Q) 

f K'{u,,)*(K{u,,) - .gt) + - Tjfi^ = 0, 
[ (/-t,,, u — Urj) > Vu e C, 

where is a Lagrange multiplier for the constraint C. In view of the difFcrentiability of the penalty, the 
Lagrange multiplier /x^ is explicitly given by u,j 4- iA''(u^)*(A'(w^) — g^). 

Now we can derive a second-order necessary optimality condition for problem ^ . 
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Lemma 2.1. The necessary optimality condition of a minimizer to the Tikhonov functional with 
the exact data is given by: for any u E C 

^\\K{u,,) - K{u)\\^ + §||u„ - uf + {K{u.,,) - g\E{u,u,,)) + l^{^l.,J,u - u,^) > 0, (6) 

where fin is a Lagrange multiplier associated with the constraint C . 

Proof. By the minimizing property of u,,, we have that for any u E C 

\\\K{u,) g^r + ^u,f< i||A» - /IP + ^u\\\ 

Straightforward computations show the foUowing two elementary identities 



1 IL, l|2 1 |L,||2 _ 1 ||„, „,||2 



K) - g^f - H - ff^f = -i||i^K) - K{u)f - (A'K) - g\K{u) - K{u^)). 
Upon substituting these two identities, we arrive at 

- \\\K{u,,) - K{u)\\' - %\\u,, ~uf- jf{u^, u~u^)~ {K{u^) - g\K{u) - K{u,,)) < 0. (7) 
Now, the optimality condition for the minimizer u,, (cf. ([S])) is given by 

K'{u,^)*{K{u^) - g^) + i]u,^ - rifir^ = 0, 
where firi is a Lagrange multiplier for the constraint C. Consequently, 

Assisted with this identity and the second-order error E(u,Uri), inequality ^ yields immediately the 
desired assertion. □ 

One salient feature of the optimality condition ([6]) is that it is true for any u Cz C and thus it is a global 
one. Also, the term (/i,, , w — u,,) is always nonnegative. The necessary condition (j6]) may be strengthened 
as follows: there exist some Cg e [0, 1) and e' > such that 

i||Ar(u^) - A'(u)ip + ^\\ujj - u|p + {K{u.r,) - g\E{u,Ur,)) + ri{iir,,u- Un) 

> f IIA'K) - K{u)f + ^\\u- Urf yu e C. 

That is, the left hand side of ([8]) is coercive in the sense that it is bounded below by the positive term 
^ljA'(u^) — K{u)\\^ + ^\\u — u,f. This condition is analogous to, but not identical with, the positive 
definitcness requirement on the Hessian in classical second-order conditions in optimization theory [241117] . 
Nonetheless, we shall call condition ([HI)/® a second-order necessary /sufficient optimality condition. 

Remark 2.1. The case of a general convex can be handled similarly using Bregman distance, which 
is defined by d^{u, u) ~ tp{u) — tpi^) — m — u) for any ^ G dip{u). Then repeating the proof in Lemma 
\2.1\ gives the following necessary optimality condition (^^ e di]j{un)) 

i||A'(w^) - A'(w)||^ -I- rid^^{u,Ur,) + {K{un) - g\E{u,Ur,)) + 7]{fir,,u- Un) > 0. 

All subsequent developments can be adapted to general penalty ijj by replacing ^\\u — u\\^ with d^{u,u). 
We refer interested readers to and references therein for properties of Bregman distance. 

Note that there always holds Ujj — > subscquentially as — 0^ 0- Assume that ^ ""^^"'''^ — !• w 
weakly and /j^ ^ /^^ weakly in suitable spaces as — > 0^. Then by taking limit in equation (O as 
— > 0"*", we arrive at the following source condition. 
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Condition There exists a w E H and fi^ E X* such that the exact solution satisfies 

r -K'{u^rw + u^ -fi^ = 0, 

\ {fi\u-u^) >0 VueC. ^ ' 

The source condition ^ is equivalent to assuming the existence of a Lagrange multiplier w (for the 
equality constraint K(u) ^ g^) for the minimum- norm problem 

min subject to K{u) = and u e C, 

and, hence, the source condition ^ represents a necessary optimality condition for the minimum-norm 
solution . We note that, in case of a linear operator K , there necessarily holds the relation: v) E Ji{K*), 
the closure of the range space IL{K*). The source condition is stronger since for ill-posed problems 
generally R{K*) ^ R(X*). 

Now we can introduce our nonlinearity condition based on a second-order sufficient condition. 

Condition There exists some e > and > such that the exact solution satisfies 

f ||x(w)-x(w^)||2 + i||ii-u^||2-(w,i;(w,ii^)) + (/i^M-u^) > f||u-u^||2 vmgc. (10) 

Here the elements w and are from the source condition The nonlinearity term {w, E{u,u^)) 
is motivated by the following observation. If the source representer w does satisfy w = lim^_i.o - — 
(weakly), then asymptotically, we may replace K{u,j)~g^ in (|5]) with —r/w, divide (|H]) by 77 and take 77 — > 
to obtain pO)) . upon assuming the convergence of to a finite constant Cr- We would like to point out 
that the constant c,. may be made very large to accommodate the nonlinearity of the operator K. The 
only possibly indefinite term is {w, E{u, u^)). Hence, the analysis of {w, E{u, w^)) is key to demonstrating 
the nonlinearity condition (|10p for concrete operator equations. 

Remark 2.2. On the nonlinearity condition (jlOp . we have the following two remarks. 

(1) In case of constrained Tikhonov regularization, we may have w = 0, which results in {w, E{u,u^)) = 
and thus the nonlinearity condition (jlOp automatically holds. For example, if C ^ {u : u > c}, 
with c being a positive constant, and u"^ ~ c is the exact solution {i.e., g^ = K{u^)), then w = and 

= satisfy the source condition ([9]). Moreover, if the set {ji^ 7^ 0} has a positive measure, then 
the term {n^ ,u — u^) provides a strictly positive contribution to (jlOp . These are possible beneficial 
consequences due to the presence of constraints. 

(2) A classical nonlinearity condition JSj] reads 

K'{u) is Lipschitz continuous with a Lipschitz constant L satisfying L\\w\\ < 1. (11) 

There are several other nonlinearity conditions. A very similar condition fllf is given by \\E[u, u)\\ < 
^\\u — u\\'^ with L\\w\\ < 1, which clearly implies condition (|10[) . Another popular condition pp. 
6, eq. (2.7)] reads 

\\E{u,u)\\ <ce\\K{u)-K{u)\\\\u-u\\. (12) 

It has been used for analyzing iterative regularization methods. Clearly, it implies (jlOp for Cr > 
{ce\\w\\)'^. We note that it implies ([5]) after applying Young's inequality. 

The following lemma shows that the proposed nonlinearity condition (jlOp is much weaker than the 
classical one, cf. ([TT|). Similarly one can show this for condition (|T2|) . Therefore, the proposed approach 
docs cover the classical results. 

Lemma 2.2. Condition (jlip implies condition pop . 
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Proof. A direct estimate shows that under condition (jlip . we have 

\{w,E{u,u''))\ < \\w\\\\E{u,u'')\\ < \\w\\ ■ M^lp 
by |]£'(u,w^)|l < -jllw — u^\^ from the Lipschitz continuity of the operator K'[u). Consequently, 
\\\u- u^f - {w,E{u, u^)) + {n\u - u^) >^\\u-u^f- ^L\\w\\\\u - u^f + {tx\u- u^) 

by noting the relation {ii\u — v)) > for any u G C. This shows that condition (fTO|) holds with 
e = 1 - > and 0. □ 

Remark 2.3. The condition ijluijl < 1 is used for bounding the nonlinearity term {w, E{ujj,u^)) from 
above. This is achieved by Cauchy-Schwarz inequality, and thus the estimate might be too pessimistic 
since in general {w^E(u,u'^)) can be either indefinite or negative. This might explain the effectiveness 
of Tikhonov regularization in practice even though assumption (jlip on the solution and the operator 
K{u) may be not verified. 

2.3 Basic error estimates 

We derive two basic error estimates under the source condition (j9|) and the nonlinearity condition ()10p : 
the approximation error — u'^W due to the use of regularization and the propagation error jju^ — w,, | 
due to the presence of data noises. These estimates are useful for analyzing convergence rates of some 
parameter selection rules as 77 — > 0+ [3 [3D] . 

Lemma 2.3. Assume that conditions ([9]) and pOp hold. Then the approximation error — satisfies 

\\u,-u^\\<e-i\\w\\^^= and \\K{u,) - g^\\ < -^^\\w\\. 

Moreover, if there exists some e' > independent of rj such that the second- order sufficient optimality 
condition ^ holds for all u (z C , then the propagation error — satisfies 



lliif, - < ^==-p and \\K{u\) - -ft:(it,,)| 



1 5 . 25 



< 



Proof. The minimizing property of the approximation w,, and the relation g^ = K{u^) imply 

i||i^K)-5^f + ill«.|p<fh^|p. 

The source condition ([9]) and Cauchy-Schwarz inequality give 

i|jA'K)-.gt||2 + |||,,^_yt||2 < _,^(„t^y^_^,t) 

= -?/(w, K'{u^){Ur^ - U^)) - ri{^\ur^ - v)) 

= —r]{w, K{un) - g^) + ri{w, E{u,j,u^)) - r]{fi\urf - u^) 
< r]\\w\\\\K{uj,) - g^W + t]{w, E{urj,u^)) - rjili' ,Ur, - u^). 

By appealing to the nonlinearity condition (jlOp . we arrive at 

l^lii^K) - g^r + ^\\u, «t||2 < ^\\^,\\\\K{u,) g^\\. 

Consequently, by ignoring the term — ""^ll^i "^c derive the estimate 

ll/^K)-5^|| <T^||HI, 
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and meanwhile, by invoking Young's inequality, we have 

i.e., — M^ll < ||i(;||-p^^=. This shows the first assertion. 

Next we turn to the propagation error — u^\\. We use the optimality of the minimizer wf^ to get 

\\\K{ul) a'f + -uX< hmu,) g'f rj{u„u',, u,) . (13) 

Upon substituting the optimality condition of li,, (cf. ([S])), i.e., 

Tyu,, = -K'{u,-i)*[K{u,-i) - g^) + tj^j.,,, 

into (|13p . we arrive at 

^WKiul) - + f - < i||7^K) - g'f + {K{u,) - g\K'{u,){u'^ - u,)) - rj{^i„ - u,) 
= i|jA-(w^) - g'r + - K{u,)r - {K{ul) - g'j<{u'^) - K{u,,)) 

+ {K{urj) - g\K'{u,j){u^^ - It,,)) - 77(/i,,, - w,,). 

Now the second-order error E{u^,Uji) and the Cauchy-Schwarz inequality yield 

< - (A'K) - /V «) - K{ur,)) + {K{u,,) - g\K'{u^){ul - u^)) 
= {g' - 9\K{ul) - K{ur,)) - {K{u,) - g\E{u%Ur,)) 
<\\g' -g^\\\\K{ut^)-K{u.,)\\ - {K{u,) - g\E{ul,u,)). 

Consequently, we have 

^\\K{ul) - K{u„)f + f - u,\\' + {K{u,) - g\E{u%u^))+ln{^l^,ul - u^) < S\\K{ul) - K{u,)\\. 
Finally, the second-order sufficient optimality condition ([5]) implies 

||A>^)-A-K)|| <f and \\uf^ - u^\\ < ^ j= . 

This completes the proof of the lemma. □ 

3 Applications to choice rules 

In this part, we illustrate the utility of the proposed nonlinearity condition (jlOp for analyzing selection 
rules, i.e., a priori rule, discrepancy principle [55], balancing principle ITni[T5l[T3] and Hanke-Raus rule 
[lOj . in deriving either a priori or a posteriori error estimates. The a posteriori error estimates seem new 
for nonlinear problems. Also a first consistency result is provided for the balancing principle. 

3.1 A priori parameter choice & discrepancy principle 

First, we give a convergence rates result for a priori parameter choices. 
Theorem 3.1. Under conditions ^ and (jlOp . we have the following estimates 
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Proof. In view of the optimality of the minimizer and the source condition 0, we have 

With the help of the second-order error E{u, u), we deduce 

< ^\\K{u^) - .g'f - v{w,Kiul) - K{u^)). 
Now from the nonhnearity condition (1101) and the Cauchy-Schwarz and Young's incquaUties, we obtain 

< l±^s' + v\\w\\ (!| A'(<) - g'W + y g'W) 

< l±^s' + ^ll^ll j| A^(,,^) _ + ^5||^„||, 

where we have made use of the inequahty 

Ili^K) - /V (^t)||2 < 2||A'(^f^) _ /||2 + 2||i^(ut) - g^f. 

Using Young's inequahty again and the fact that < + Pia, b,c> 0) imphes c < a + b gives 

H -uM\< e-i {^^S + ^^\\w\\ + ■ 
Meanwhile, by ignoring the term |||it^ — u^^||^, we deduce 

II^K) - a'W < (Vl + 2crvS + ^j^77||^«|| + v/MR) ■ 

This concludes the proof of the theorem. □ 

Therefore, the a priori choice rj 5 achieves a convergence rate C (5 5) and 0((5) for the error ||m^ — 
and for the residual \\K{u^^^) — g^^||, respectively, which coincide with that for the classical nonhnearity 
condition ([TT|) j8]. 

Next we illustrate the proposed approach for the popular discrepancy principle due to Morozov |26j . 
i.e., we determine an optimal parameter 77 by: for some constant c™ > 1 

\\K{ul)-g'\\^c^5. (14) 

The principle is very useful if a reliable estimate of the noise level 5 is available. The rationale is that 
the accuracy of the solution wf^ should be consistent with that of the data (in terms of residual). The 

consistency of the principle is well-known, and also it achieves a convergence rate 0[5^) under the classical 
nonhnearity condition ([TT|) [S] . The following result shows that the (weaker) nonhnearity condition ((TU| 
can reproduce the canonical convergence rate (D{5^). We would like to remark that the principle can be 
efhciently implemented by either the model function approach or quasi- Newton method [20| . 

Theorem 3.2. Let conditions © and (jlOp he fulfilled, and rf he determined by principle (jl4p . Then 
the solution uf^, satisfies the following estimate 

IK - < ^ (v/2(l + c„0||«;||<5^ + (1 + c,n)V^5) . 
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Proof. The minimizing property of uf , and the defining relation (fT4|) imply 



Upon utilizing the source condition ([9]) and the second-order error E{uf^t ,u^), we deduce 

Now the nonlinearity condition (|f Op yields 

f IK - ^t||2 < ||^„|||| - + ^^WK^u^,) - K{u^)f 

< {c,n + l)\\w\\6+f{l + c^f6^, 
where we have used the triangle inequality and (|14p as follows 

«) - K{u^)\\ < - /II + ||A>t) _ 9^11 < (1 + 

The desired estimate follows immediately from these inequalities. □ 

3.2 Two heuristic rules 

Next we apply the proposed approach to deriving a posteriori error estimates for two heuristic selection 
rules, i.e., balancing principle [l9l[14] and Hanke-Raus rule [lOllZl- These rules were originally developed 
for linear inverse problems and recently also for nonsmooth models |181 114j , but their nonlinear counter- 
parts have not been studied. The subsequent derivations rely crucially on Lemma [2?3l and thus invoke 
the second-order sufficient optimality condition ([8|). 

3.2.1 Balancing principle 

There are several rules known under the name balancing principle |15( Sect. 2.2]. The variant under 
consideration is due to [14], which has been successfully applied to a variety of contexts, including 
nonsmooth penalties |14j . It chooses an optimal regularization parameter rj* by minimizing 

ri = arg mm , 15 

where F{7j) = Jr]iufj) is the value function, and 7 > is a fixed constant. We refer to [14] for fine 
properties of the function F{ij). Generically, for the fidelity (j){u,g^) and penalty ip{u), it is equivalent to 
augmented Tikhonov functional J(u, A, r) recently derived from Bayesian paradigm fl9j 

J(u, A, r) = T(j){u,g^) + Ai/'(u) + /3oA — ao In A + /?it — ai Inr, 

which maximizes the posteriori probability density p{u,T, X\g^) oc p{g^\u,T)p{u,T, X) with the scalars r 
(noise precision) and A (prior precision) both having Gamma distributions. Here the parameter pairs 
(ao,/3o) and (ai,/3i) are closely related to the scale/shape parameters in the Gamma distributions. The 
approach determines the regularization parameter 77 by 77 = Ar~^, and in the case of /3o = /3i = 0, the 
selected parameter 77* satisfies rule (fTSj) with the free parameter 7 being fixed at the ratio ^ J^^. 

The name of the rule originates from the fact that the selected parameter rf automatically balances 
the penalty tp{u) = \\u\\^ with the fidelity (j){u,g^) = ||^'''(w) — .9^|P through the balancing relation 

77?lKlP = ||/i«)-.9'ir (16) 

This relation also shows clearly the role of the parameter 7 as a balancing weight. 
First, we give an a posteriori error estimate for the approximation u*. . 
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Theorem 3.3. Let the conditions in Lemma \2.S\ be fulfilled, rf he determined by rule (jlSp. and (5* 

||A'(ii^.) — g'^ll be the realized residual. Then the following estimate holds 



V F-^(r,")J 

Proof. By the triangle inequality, we have the error decomposition 

Wu^ -u'W < \\ufj - u,-i\\ + \\u,j - u^\\. 

It suffices to bound the approximation error — and the propagation error — u,, j|. For the 
former, we have from the source condition (|9]) (cf. the proof of Lemma [2. 3p that 

Consequently, we get 

-u^f - {w,E{u^,,u'f)) + - w^) < ||u;||||A'(u^-) - .9^11- 

However, by the triangle inequality and Lemma 12.31 the term \\K{ujj*) — can be estimated by 

\\KM-g^\ < \\KM-K{u^.)\\ + \\K{u'^,)-g'\\ + \\g'-g^\\ 
<^ + S* +S< max((5, S,). 

This together with the nonlinearity condition (jlOp yields 

f h,* - < M\\KM - 5^11 + f\\KM - , 

i.e., 

\\u^'-u^\\ < 7J (2^^^Hniax(5,<5,)5 + 2^y^max(<5,J,)) . 

Next we estimate the propagation error Wu^y* — u^i* \\. The minimizing property of the selected parameter 
rj* implies 

By Lemma 12.31 we have 

< 1 -5^i^max(<5,(5*)i 
The desired estimates follows from the choice c = max(2^^^^^^^^-\/j|w|| + ^^^^^ y/c^Taax{5, S^)^ , v^'''' 



We note that both 6 and are naturally bounded, so the constant c in Theorem 13.31 can be made 
independent of max((5, 5*). The estimate provides an a posteriori check of the selected parameter 77*: if 
the realized residual 6^ is far smaller than the expected noise level, then the prefactor {S)F ^ (^7*) 
might blow up, and hence, one should be very cautious about the reliability of the approximation x^. . 
Despite a posteriori error estimates for the principle, its consistency remains unaddressed, even for linear 
inverse problems. We make a first attempt to this issue. First, we show a result on the realized residual 

Lemma 3.1. Let the minimizer rj* = ri*{d) of rule (|15p be realized in (0, ||/\ Then there holds 

^0 as 6^0. 
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Proof. By virtue of |14[ Thm. 3.1], the balancing equation (jl6p is achieved at the local minimizer 77*. 
Consequently, it follows from and the optimality of the selected parameter rj* that 

[(1+ 7-^)11 A-(0-/|Pr+^ ^ F^+\rf) ^ F^+\ij) 
rj* V* ~ V ^ 

for any 77 G [0, ||i4r|p]. However, with the choice fj ^ 5 and by the optimality of the minimizer m^, we have 

F(.y)^i||A>^)-/|P + f||4lP 

<^+S\\um^^5. 

Hence, by noting the condition 7 > and the a priori bound rj* G [0, || A'|p], we deduce that the rightmost 
term in (|17p tends to zero as (5 — > 0. This shows the desired assertion. □ 

We can now state a consistency result. 

Theorem 3.4. Let there exist some M > such that \\u^\ < M, and the assumption in Lemma \3.1\ he 
fulfilled under the constraint C = {u ^ X : \\u\\ < M} . If the operator K is weakly closed and injective, 
then the sequence {w^.^^jja of solutions converges weakly to . 

Proof. Lemma l3.ll implies ||A'(w*.) — g^\\ ^ as 5 ^ 0. The a priori bound Hufj. || < M from the 
constraint C implies the existence of a subsequence of {uf^,}, also denoted by {u^,}, and some u* such 
that M^. — > u* weakly. However, the weak closedness of the operator K and weak lower semi-continuity of 
norms, yield ||A'(it*) — g^^j] = 0. Hence, K{u*) = g\ which together with the injectivity of the operator K 
implies u* = . Since every subsequence has a subsequence converging weakly to v) , the whole sequence 
converges weakly to it^ . This concludes the proof of the theorem. □ 

Hence, the balancing principle is consistent provided that there exists a known upper bound on the 
solution u\ which is often available from physical considerations. This provides partial justification of its 
promising empirical results |14| . In view of the uniform bound on the sequence {77* ((5)} in the defining 
relation (|15p. {ri*{5)} naturally contains a convergent subsequence. However, it remains unclear whether 
the (sub)sequence {rj*{5)} will also tend to zero as the noise level 5 vanishes. 

3.2.2 Hanke-Raus rule 

The Hanke-Raus rule [TUJ[7] is based on error estimation: the squared residual \\K{ul^) — divided by 
the regularization parameter 77 behaves like an estimate for the total error (cf. Theorem 13. ip . Hence, it 
chooses an optimal regularization parameter 77 by 

. HA-«)-/ip 
77 = arg mm ■ . (18) 

';e[o,l|A'P] ?7 

We have the following a posteriori error estimate for the rule ([T5|) . 

Theorem 3.5. Let the conditions in Lemma \2.S\ be fulfilled, rf be determined by rule (|18p . and ~ 
||A'(u^.) — g'^ll ^0 be the realized residual. Then for any small noise level 6, there holds 

||wf,- -u'^W <c(e-i\\w\\^ +e'~^j-'^ mSix{S,S*)i. 

Proof. As in the proof of Theorem 13. 3[ we deduce that the error ||it,,. — satisfies 

\\urr - u^W < ^ (2^v^max(^,'5,)* + ^^max{d,S. 
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Next we estimate the error — The definitions of 77* and (5* indicate 



If ?7 

for any fj G [0, jj/^jp]. By taking t) = 6 vo. the inequahty and noting Lemma |2.3[ we deduce 

<5:H-\\K{4)-g'r 

< 5~^5-' {\\Ki4) - K{us)\\ + \\Kius) - g^\\ + \\g^ - g'\\Y 

^ + t^II-I|)'m-. 

Using again Lemma 12.31 we arrive at the foUowing estimate 



After settingc = max(2^^^Sy]|tx^+^^^y(V max((5, 5*)^, --^{^^+ g ll^"ll))i the desired assertion 
foUows from these two estimates and the triangle inequahty. □ 



4 A class of nonlinear parameter identification problems 

Now we revisit the source condition © and nonhnearity condition pH]) for a general class of nonlinear 
parameter identification problems. The features of the source and nonhnearity conditions are illuminated 
by utilizing the specific structure of the adjoint operator K'{u^)* . Then we specialize to problems with 
bilinear structures, and show the unnecessity of the source representer w for (numerically) evaluating the 
nonhnearity term {w, E{u,u^)). Here we shall focus on derivations in an abstract setting, and refer to 
Section [5] for concrete examples. 

4.1 A general class of parameter identification problems 

Gcnerically, nonlinear parameter identification problems can be described by 

/ e{u,y) = 0, 
\ K{u) = Cy{u), 

where e{u, y) : X x Y ^ Y* denotes a (differential) operator which is differentiable with respect to both 
arguments u and y, and the derivative Cy is assumed to be invertible. The notation y{u) £ Y refers to 
the unique solution to the operator equation e(u, y) = for a given u, and the operator C is linear and 
bounded. Typically, the operator C represents an embedding or trace operator. 

To make the source condition ([9|) more precise and tangible, we compute the derivative K' [u)5u (with 
the help of the implicit function theorem) and the adjoint operator K'{u)* . Observe that the derivative 
y'{u)Su of the solution y{u) with respect to u in the direction du satisfies 

e„(u, y{u))5u + ey{u, y{u))y'{u)5u = 0, 

from which follows the derivative formula 

y'{u)5u = -{ey{u, y(u)))"^e„(u, y{u))6u. 
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Consequently, we arrive at the following explicit representation 

K'{u)5u = -C{ey{u,y{u)))~^eu{u,y{u))Su. 

Obviously, the adjoint operator K'{u)* is given by 

K'{u)*w = -eu{u,y{u))*{ey{u,y{u)))~*C*w. 

With the expression for the adjoint operator K'{u)* , the source condition i.e., K'{u^)*w = m^ — /i^, 
can be expressed more explicitly as 

-eu{u\y{u^)ney{u\y{u^)))-*C*w = - 

This identity remains valid by setting p = —{ey{u\y{u^)))^*C*w. In other words, instead of the source 
condition (|9]), we require the existence oi p G Y and p^ such that 

eu{u\y{u^)rp = u^ (19) 

and {p\u — u^) > for any u G C. This identity represents an alternative (new) source condition. A 
distinct feature of this approach is that potentially less regularity is imposed on p, instead of on w. This 
follows from the observation that the existence of p e y does not necessarily guarantee the existence of 
an element w G H satisfying p = —{ey{u\y{u^)))^*C*w due to possibly extra smoothing property of 
the operator {ey{u\y{u^)))~* , see Example 15.21 for an illustration. Conversely, the existence oi w always 
implies the existence of p satisfying the new source condition (|19p. Therefore, it opens an avenue to relax 
the regularity requirement of the source representer. Such a source condition underlies the main idea of 
the interesting approach in [9] for a parabolic inverse problem. 
Under the source condition pop . we have 

(w, E{u, u^)) = (w, Cy{u) - Cy{u^) - Cy'{u^){u - u^)) 
= {C*w, y{u) - y{v)) - y'{v)){u - v))) 

= {{ey{u\y{u^))y*C*w, ey{u\y{u'')){y{u) - y{u^) - y'{u^){u - u^))) 
= -{P,ey{u\y{u'^)){y{u) - y{v)) - y'{u^){u - v)))). 

Accordingly, the nonlinearity condition pO[) can be expressed by 

f ||i^(u) - K{u^W + {P, ey{u\y{u^)){y{u) - y{u^) y' {u^){u - u^))) 

Therefore, the term (p, ey{u\ y{u^)){y{u) — y{u^) — y'{u^){u~u^))) will play an important role in studying 
the degree of nonlinearity of the operator K, and in analyzing related Tikhonov regularization methods. 
We shall illustrate its usage in Example 15.41 We would like to point out that the nonlinearity condition 
((20)) can be regarded as the (weak) limit of the second-order sufficient condition ([8|) , which in the current 
context reads 

{ey{ujj,y{u^)y*C*{K{ujj) - g''),ey{u,,,y{u.^)){y{u) - y{u^) - y'{u,,){u - u^))) 

+ i||A'(u„) - K{u)f + ^u,j -uf + Vif^v^u- "r,> > t ll^(w^) - K{u)\f + ^\\u- u^f. 

The source condition (|19p together with the nonlinearity condition ([20]) can yield identical convergence 
rates for nonlinear Tikhonov models as conditions ([9]) and (jlOp , since conditions ([T9| and ([20| are exactly 
the representations of conditions ([9]) and ()10p in the context of parameter identifications. The main 
changes to the proofs are the following two key identities 

{w,K'{u^){u-u^)) = {p,e.^{u\y{u^)){u^u^)){^ {u^ - p^ ,u ^ u^)), 
{w, K{u) - K{u^)) = -{p, ey{u\y{u^)){y{u) - y{u^))), 
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and the remaining steps proceed identically. 

In the rest, we further specialize to the case where the operator equation e{u, y) ~ assumes the form 

Aiu)y-f = 0. 

A lot of parameter identification problems for linear partial differential equations (systems) can be cast 
into this abstract model, e.g., the second-order elliptic operator A{u)y = —S/ ■ {a{x)\i'y) + h{x)-\i'y + c{x)y 
with the parameter u being one or some combinations of a{x), h(x) and c{x). Then upon denoting the 
derivative of A(u) with respect to u by A'{u), we have 

eu{u,y{u))5u = A'{u)Suy{u) 

and 

ey{u,y{u)) = A{u). 

The derivative A' {u)Suy(u) can be either local (separable) or nonlocal. For example, in the former 
category, A{u)y ~ (~A + u)y with A' {u)Suy{u) = y{u)Su. The case A{u)y = —V • (uVy) with 
A'{u)Suy{u) = —V • {Su\7y{u)) belongs to the latter category. The local case will be further discussed in 
Section [4.21 Consequently, the (new) source and nonlinearity conditions respectively simplify to 

euiu^yiu"^))* p - p} 

and 

f ||A'(«) - K{u^)f + {p.A{u^)[y[u) - y[u^) - y'{u^){u - u^))) 

+ \\\u~u^f + {^i\u~u^) > ^\\u-u^f VueC. 

4.2 Bilinear problems 

Here we elaborate the structure of the crucial nonlinearity term {w, E{u, u^)) in the proposed nonlinearity 
condition (|10p . Interestingly, it admits a representation without resorting to the source representcr w for 
bilinear problems. Specifically, the following class of inverse problems is considered. Let the operator 
e{u,y) be (affine) bilinear with respect to the arguments u and y for fixed y and u, respectively, and for 
a given m, eu{u,y) is defined pointwise (local/separable). 

We begin with the second-order error E{u^u^) for bilinear problems. The bilinear structure of the 
operator e(w, y) implies 

= e{u, y{u)) - e{u\y{v))) 
= ey{u^ ,y{u^)){y{u) - y{u^)) + e„(u, y{u)){u - v)), 

i.e., 

y{u) -yiu') = ~{ey{u\y{v))))~'^eu{u,y{u)){u - u^). 
Therefore, we deduce that 

E(u, v)) = K{u) - K{v)) - K'(u^)(u - u^) 

= Cy{u) - Cy{u^) + C{ey{u\y{u^)))-' eu{u\y{u^)){u - u^), 

= -C{ey{u\y{u^))r^eu{u,y{u^)){u-u^) + C{ey{u\y{u^)))-^e^{u\y{u^)){u-u^) 
= -C{ey{u\y{v))))~'^{eu{u,y{u)) - eu{u\y{v)))){u ~ v)). 

With the help of the preceding three relations, the source condition K'{u^)*w — ~ p^ and locality 
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(separability) of eu{u,y{u)), we get 

{w,E{u,u'')) = {w, -C{ey{u\y{u^))y^ {eu{u,y{u)) - e„('u^ - u^)) 

-eu{u\y{v})) (ey{u\yiy))) C w, — — —r- {u~u^)) 

.w/ u* euiu,yiu)) - eu{u'< ,y{u^)) ^ 
A w' w, — — w-u') 



eu{u,y{u)) - _ |. 



eu{u\y{u^)) 

Therefore, we have arrived at the following concise representation 

/ p,/ tx\ / t t eu{u,y{u)) - eu{u\y{v))) ^ \ 

\ e„(wT,y(uT)) / 

A remarkable observation of the derivations is that the source representor w actually is not needed 
for evaluating {w, E{u,u^)) , which enables possible numerical verification of the nonlinearity condition 
pO)) . Note that even if we do know the exact solution , the representer w is still not directly accessible 
since the operator equation K'{u^)*iv = is generally also ill-posed. Hence, the representation ([2T|) is 
of much practical significance. Another important consequence is that it may enable estimates of type 
()12|) . thereby validating the nonlinearity condition pop . This can be achieved by applying Holder-type 
inequality if the image of K{u) and the coefficient u share the domain of definition, e.g., in recovering 
the potential/leading coefficient in an elliptic equation from distributed measurements in the domain, see 
Example 15.41 for an illustration. 

Finally, we point out that formally the representation (|21l) can be regarded as the limit of 

eu{u,y{u)) - eu{urf,y{u^)) , . 
" e (u viu )) ^" " ""^ 

as 77 goes to zero, which might be computationally amenable, and hence enable possible numerical veri- 
fication of the second-order sufficient condition ([8|) . 



5 Examples 

In this section, we illuminate the nonlinearity condition (|10p and the structure of the term {w, E{u,u^)) 
with examples, and discuss the usage of the source and nonlinearity conditions (fT9)) and ((20| . 

First, we give a one-dimensional example where the smallness assumption < 1) in the classical 

condition ([TT|) is violated while the proposed nonlinearity condition ([T0|) is always true. 

Example 5.1. Let the nonlinear operator A' : M — > M be given by 

K{u) = eu{\ — u), 

where e > 0. The solution depends sensitively on u if t is very small, hence it mimics the ill-posed behavior 
of inverse problems. Let the exact data {necessarily smaller than |) be given, then the minimum-norm 
solution is given by 

= i (1 - v/l - 4e-i.gt) . 

It is easy to verify that 
and 

E{u, v)) = K{u) - K{v)) - K'{u^){u - u^) 

= eu{l - It) - eu^l - u'f) - e(l - 2u^)(ii - u^) 
= -e{u-u^)\ 
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Now the source condition K'{u^)*w ~ implies that the source representer w is given by w = e(x-2-ut) ■ 
Therefore, the nonlinearity term {w, E{u,u^)) is given by 

which is smaller than zero for a fixed but sufficiently small g^ > 0. Moreover, the pref actor i^^^-f can be 
made arbitrarily large, thereby indicating that the smallness assumption {L\\w\\ < 1) can never be satisfied 
then. Actually, the Lipschitz constant L of K'iu) is L = It, and L\w\ = iyz^t-tj, which can be arbitrarily 

large {if is sufficiently close to ■^), and thus the classical nonlinearity condition (jlip is violated. This 
shows that the proposed nonlinearity condition pO[) is indeed much weaker than the classical one. 
A direct calculation shows in the second-order sufficient condition ([8]), 



{K{u^) -g\E{u,u,,)) = V ^ _r,^ {u-Uri) . 

Observe that the form of j^{K{ujj) — g\ E[u,Uri)) coincides with that of —{w, E{u,u^)) . With this explicit 
representation at hand, the nonnegativity of the term {K{u,j) — , E{u,u,j)) , and thus the second-order 
sufficient condition ([5]) , can be numerically verified for a given g^ and every possible rj since the Tikhonov 
minimizer u^i can be found by solving a cubic equation. 

Next we consider an elliptic parameter identification problem to show that the smallness assumption 
L||wj| < 1 of the classical nonlinearity condition (|lip is unnecessary by deriving an explicit representa- 
tion of the nonlinearity term {w, E{u,u^)) . The derivations also illustrate clearly structural properties 
developed in the abstract framework in Section 14.21 

Example 5.2 (Robin inverse problem). Let f2 C be an open bounded domain with a smooth boundary 
T, which consists of two disjoint parts Ti and Tc- We consider the following elliptic equation 

—Ay = in n, 

p^f on r„ (22) 

g^+uy = h onTi. 

We measure g ~ y on Tc and are interested in recovering the Robin coefficient u Cz C = {u : u > c} for 
som,e c > by means of Tikhonov regularization 

min / \y — g^ ds -\- rj i \u\'^ds. 

It arises in corrosion detection and analysis of quenching process \21f . Let y(u) G H^(fl) be the 
solution to (|22p . and jr^_ be the trace operator to the boundary Tc, similarly ^-Pi ■ Then direct computation 
shows K{u) = 7r^j/(ti). It is easy to show that the forward operator K : L'^{Ti) i— > L'^{Tc) is Frechet 
differentiable and the derivative is Lipschitz continuous. Moreover, straightforward computations give 

K'{u^)6u ~ 7r^z(u^), 
E{u,u^) = 7r,w(u,u^), 
K'{u^)*w = —"fr,{y{u^)z{u^)), 
where the functions z{u^), v{u,u^) and z{u^) £ II^{n) satisfy 

5uy{u)ids yieH^fl), 
{u - u''){y{u) - y{u''))vds Vi G H\n), 
vuids yijeH\n). 



Vvdx + 


/. 


v) zvds = 








Vvdx + 


/. 


u^vvds = 








Vvdx + 


/. 


v) zvds = 
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Assume that the source condition (jlOp holds with the representer w G L^{Tc). Then by setting v = z{u^) 
and V — v[u, u^) respectively in the their weak formulations, it follows that 



Hence, the term {w, E{u,u''^)) exhibits the desired structure, cf. (|2ip . Next, by maximum principle, the 
solution y(u) to (|22p is positive for positive f and h. Moreover there holds 

\\/{y{u)-y{u'f))\'^dx+ / u\y{u)-y{u'f)\^ds+ / {u - u^)y{u'f){y{u) ~ y{u^))ds 0. 



Note also the monotonicity relation, i.e., if u > u\ then y{u) < y{u^). It follows from the above two 
relations that if — ji^ > 0, then 

-{w,E{u,u^)) > 0. 

This shows that the nonlinearity condition (jlOp holds without resorting to the smallness condition L\\w\\ < 
1 in the classical nonlinearity condition (jlip under the designated circumstance. 

Next we contrast the source condition (|19p with the conventional one (|9]). The operator e{u,y) is 
bilinear, and eu{u,y{u)) = jYiyiu), eu{u,y(u))* p ~ jYiipyiu)). Hence the new source condition 
requires the existence of some element p G H^{il) such that 

lr,{py{u^)) =u^ - p^. 

This admits an easy interpretation: for jrtP to be fully determined, jTiyiu^) cannot vanish, which is 
exactly the identifiability condition (via Newton's law for convective heat transfer) 121^ . The representers 
p and w are related by {in weak form) 

/ Vp • Vwdx + / pvds = — / wvds Vv G i/^(ri). 

Jq JTi JTa 

This relation shows clearly the different regularity assumptions on p and w: the existence of p G H^{^) 
does not actually guarantee the existence of w G L'^{Tc). To ensure the existence of w G L'^{Tc), one 
necessarily needs higher regularity on p than H^(fl), presumably p G iJ^^Sl). Conversely, the existence 
of w G L'^{Tc) automatically ensures the existence of p G H^{il). 

Next we give an example of inverse medium scattering to show the same structure of the nonlinearity 
term {w, E{u,u^)) but with less definitiveness. 

Example 5.3 (Inverse scattering problem). Here we consider the two-dimensional time-harmonic inverse 
scattering problem of determining the index of refraction r? from near-field scattered field data, given one 
incident field y' fS^. Let y = y{x) denote the transverse mode wave and satisfy 

Ay + n^k^y = 0. 

Let the incident plane wave be = ^kx-d yjny^ ^ — {di,d2) G being the incident direction. Then for 
the complex coefficient u = (n^ — l)fc^ with its support within Vl C M'^, the total field y = y*"* satisfies 

y^y^+ / G{x,z)u{z)y{z)dz, 
Jn 

where G{x, z) is the free space fundamental solution, i.e., G{x, z) = ^H}^{k\x — z\), the Hankel function 
of the first kind and zeroth order. The inverse problem is to determine the refraction coefficient u from 
the scattered field 



y\x)^ / G{x,z)u{z)y{z)dz 
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measured on a near-field boundary T. Consequently, we have K(u) = jrv'^ix) G L'^{T). The Tikhonov 
approach for recovering u takes the form 

min / |A'(u) - /pds + 77 / \u\^dx. 

Here denotes the measured scattered field, and the constraint set C is taken to be C ~ {u £ L°°{Q) : 
5R(m) > 0,supp(u) CC fl}. It can be shown that the forward operator K : L^(f2) 1— >■ L'^(T) is Frechet 
differentiable, and the derivative is Lipschitz continuous on C. Now let G^{x,z) be the fundamental 
solution to the elliptic operator A + k"^ + . Then we can deduce 



)dz X £T, 

Jn 

E{u, u'^) = - I G^(x, z){u - u'^){y{u) - y{u''))dz xGT, 



K' {u'^Yw = —y[u^) J [x, z)w{x)dx, 
where ~ refers to taking complex conjugate. Then, by the source condition K'{u^)*w = — fi^ , we get 



{w,E{u,u^))L^^) ^ - / uj{x) G^x,z){u{z) -u^z)){y{u){z) -y{u^){z))dzdx 



v{u){z) ~ y{u'f){z) 



y{u^) / G'^ {x, z)w{x)dx{u{z) — u'^ [z)) ^ dz 

y{u^){z) 



Note that the structure of {w, E{u, u^))/,2(p) coincides with that in Examvle \5.2\ which further corroborates 
the theory for bilinear problems in Section \4-S\ However, an analogous argument for definitive sign is 
missing since the maximum principle does not hold for the Helmholtz equation. Nonetheless, one might 
still expect some norm estimate of the form ()12p . which remains open. In particular, then a small — jj) 
would imply the nonlinearity condition ([TU]). 

The last example shows the use of the source condition and nonlinearity condition (|20p . 

Example 5.4 (Inverse conductivity problem). Let Vl <Z he an open bounded domain with a smooth 
boundary T . We consider the following elliptic equation 

-V • {uVy) = f infl, 
y ~ on T. 

Let y{u) G Hq{^1) be the solution. We measure y {denoted by g^ G H^{Q)) in the domain ft with 
11^(5'^ ~ y{u'))\\L^(Ci) ^ i-S., K{u) = y{u), and are interested in recovering the conductivity u £ C = 
{u G H^ifl) : Co < u < Ci} for some finite cq,ci > by means of Tikhonov regularization 



min / \\/[K{u)~ g^)\^dx + ri ( \u\^ + \\/ u\^ dx . 



It arises in estimating permeability of underground flow and thermal conductivity in heat transfer fSlf . 
It follows from Meyers' theorem ]2&l that the operator K : H^{il) Hoi^) is Frechet differentiable and 
the derivative is Lipschitz continuous. The operator A{u) is given by —V • (uV-). It is easy to see that 

A'{u)Suy{u) = -V • {SuVy{u)), 
eu{u, y{u))*p = Wy{u) ■ Vp. 
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Consequently, the source condition (jl9p reads: there exists some p G HqIH) such that 

Vy{u^) ■Vp= {I - A)u^ - p\ 

which amounts to the solvability condition \/y{u^) 7^ (c/., e.g., \28l \16 f ). The nonlinearity condition 
(|20p is given by 

f II V(y(u) - y(u^))||i.(o) - {u^yp, ^Eiu, u^)) 

+ i||u - u^ll^i(n) + (m^" - u^) > f II"- "^Hhm^) Vii e C, 

where E{u, u'^) ~ K{u) — K{u^) — K'{u^){u — u^) is the second-order error. By setting v = p in the weak 
formulation of E{u,u^), i.e., 

/ u'^VE{u,u'f) -Vvdx = - / (ii-w^)V(2/(w) -?/(?/■'■)) -Vudx € H^{n). 
J Jn 

and applying the generalized Holder's inequality and Sobolev embedding theorem, we get 

\{u^Vp,VE{u,u^))\ < \\V{y{u) - y{u^))\\L2^n)\\u - u^\\LHn}\\yp\\Lp{n) 
< C||Vp|Up(o)||V(2;(u) - {y^))\\L^{n)\\u- u^Wmm, 

where the exponents p,q > 2 satisfy ^ + ^ = 5 {the exponent p can be any number greater than 2). 
Therefore, we have established condition (|12p for the inverse conductivity problem, and the nonlinearity 
condition (j20p holds provided that the source representer p € W^'^{^1) for some p > 2. We especially 
note that the smallness of the representer p is not required for the nonlinearity condition (j20p for this 
example. The convergence theory in Section 3 implies a convergence rate ||w^ — "^||_ffi(j2) ^ C^fS for the 
Tikhonov model with the a priori choice rj ^ 5 and the discrepancy principle. 

Note that the classical source condition ^ reads: there exists some w G i?o(ri) such that 

K'iu^Yw^ {I -A)u^ - p\ 

or equivalently in the weak formulation 

{VK'{u^)h,Ww) = {u\h)Hiin) - {l^^,h) Vh e H^n). 

This source condition is difficult to interpret due to the lack of an explicit characterization of the range 
of the adjoint operator K'{u^)*, as often is the case of parameter identifications 0/. Also the weak 
formulation of K' {u^)h G HQ{n), i.e., 

does not directly help due to subtle differences in the relevant bilinear forms. Nonetheless, the representers 
p and w are closely related by 

p= (yl(ut))-i(-A)u;. 

This relation indicates that the operator {A{u^))~^ {— A) renormalizes the standard inner product (V-, V-) 
on ffo(r2) to a problem-adapted weighted inner product (u^V-,V-), and thus facilitates the interpretation 
of the resulting source condition. This shows clearly the advantage of the source condition (|19p . 

6 Concluding remarks 

In the paper, we have presented a new approach to constrained nonhnear Tikhonov regularization on the 
basis of a second-order sufficient condition, which was suggested as an alternative nonhnearity condition. 
The proposed approach allows deriving convergence rates for several a priori and a posteriori parameter 
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choice rules, including discrepancy principle, balancing principle and Hanke-Raus rule, and thus it is useful 
in analyzing Tikhonov models. The structures of the source condition and nonlinearity condition were 
discussed for a general class of nonlinear parameter identification problems, especially more transparent 
source and nonlinearity conditions were derived. It was found that for bilinear problems, the source 
representer w in the crucial nonlinearity term {w, E{u,u^)) actually does not appear. The theory was 
illustrated in detail on several concrete examples, including three exemplary parameter identification 
problems for elliptic differential equations. It was shown that the proposed nonlinearity condition can 
be much weaker than the classical one, and the crucial term {w, E{u, u^)) can admit nice structures that 
are useful for deriving error estimates. 
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